Publishing High-Dimensional Micro Data Using Anonymization Technique

نویسندگان

  • K. Kiruthika
  • M. S Kavitha
  • S. Gayathiri
چکیده

Now a day’s society is experiencing very good growth in the count and variety of data collections having person-specific information as network connectivity, computer technology & disk storage space become increasingly affordable. Large databases is in use today’s society. The large amount of data available means that it is helpful to learn lot of individual information from public data. While doing, the privacy of the data should be persevered. The personal data may be misused, for various purposes. In order to improve these concerns, a number of techniques have newly been proposed to do the data mining tasks in a privacypreserving manner. So, to preserve the privacy of the data, privacy preserving methods are introduced and one of the methods is ‘anonymization’, in which a record is anonymized by using different anonymization techniques. Privacy preserving in data publishing is most important research area in data security area. Privacy-preserving data publishing provides methods and tools for sharing useful information while preserving data privacy and also analyzed many methods used for data privacy and their reward and short comes are monitored very well. Many techniques have been designed for micro data publishing with privacy preserving, such as generalization and bucketization. Several works resulted that generalization loses some amount of information specifically for high dimensional data. So it’s not efficient for high dimensional data for privacy preserving microdata publishing. In Bucketization, it does not prevents membership exposes and also does not suitable for data that do not have a clear separation between Quasi-Identifying attributes and Sensitive attributes of data. In our paper, an efficient method for data anonymization known as slicing and k-anonymity is introduced in which data can be partitioned both vertically(column) and horizontally(row). Advantage of slicing is that it works on high-dimensional data. Also, slicing preserves better data utility and privacy than any other method.An efficient algorithm is developed for computing sliced data that follow l-diversity requirement. A random permutation is done to randomly order a set of objects, that is, to permutate the data forapplying more privacy. Data slicing gives better utility than generalization and also no necessary for clear separation between Quasi-identifying and sensitive attributes. Experimental results using hospital dataaset confirm that data slicing provides data utility than generalization and more effectual than bucketization including sensitive attributes. Experiments we use the hospital patient datasets propose that our approach achieves better utility and also efficiency than any other existing and baseline algorithms while satisfying of proposed security work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Approach for Secure Data Publishing in Mining

More than a few anonymization techniques, such as simplification and bucketization, have been deliberate for privacy protecting micro data publishing. Current work has shown that generalization loses substantial quantity of in sequence, particularly for high-dimensional data. Bucketization, on the other offer, does not put off membership revelation and does not apply for data that do not have a...

متن کامل

A Novel Approach to Privacy Preserving Data Publishing Using Slicing Technique

Several anonymization techniques, such as generalization and bucketization, have been designed for privacy preserving microdata publishing. Recent work has shown that generalization loses considerable amount of information, especially for high-dimensional data. Bucketization, on the other hand, does not prevent membership disclosure and does not apply for data that do not have a clear separatio...

متن کامل

Multi-Privacy Collaborative Data publishing with Efficient Anonymization Techniques

Privacy-preserving in collaborative data publishing provides methods and tools for publishing the data while protecting the sensitive information in the data set. The success of data mining in privacy relies on the information sharing and quality of data in a distributed environment. Several anonymization techniques have been proposed such as bucketization, generalization which does not prevent...

متن کامل

ارایه یک روش جدید انتشار داده‌ها با حفظ محرمانگی با هدف بهبود دقّت طبقه‌‌بندی روی داده‌های گمنام

Data collection and storage has been facilitated by the growth in electronic services, and has led to recording vast amounts of personal information in public and private organizations databases. These records often include sensitive personal information (such as income and diseases) and must be covered from others access. But in some cases, mining the data and extraction of knowledge from thes...

متن کامل

Data attribute security and privacy in Collaborative distributed database Publishing

In this era, there are need to secure data in distributed database system. For collaborative data publishing some anonymization techniques are available such as generalization and bucketization. We consider the attack can call as “insider attack” by colluding data providers who may use their own records to infer others records. To protect our database from these types of attacks we used slicing...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016